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SERVER CONSOLIDATION DATA MODEL 



CROSS REFERENCE TO RELATED APPLICATION 

[0001] The present application claims priority to U.S . Provisional Application No. 
60/455,749, filed March 19, 2003, "Discovery and Analysis of System and Database Inventories 
for Server Consolidation," which is hereby incorporated by reference in its entirety. 

COPYRIGHT NOTICE AND PERMISSION 

[0002] A portion of the disclosure of this patent document may contain material that is 
subject to copyright protection. The copyright owner has no objection to tiie facsimile 
reproduction by anyone of the patent document or the patent disclosure, as it appears in the 
Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights 
whatsoever. The following notice shall apply to tiiis document: Copyright © 2004, Unisys Corp. 

FIELD OF THE INVENTION 

[0003] The present invention relates to the field of computing systems and, more 
specifically, to systems and methods for server consolidation. 

BACKGROUND OF THE INVENTION 

[0004] As technology has become more prevalent in business organizations, 
organizations have created server farms in an ad hoc fashion. For instance, as a new appUcation 
become available or needed, organizations often add a new server to provide the computing 
support for that application. Often times, the server would have enough computing power only 
to run that particular application. Such ad hoc server farms become an unwieldy combination of 
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overlapping appUcations. multiple versions of the same application, redundant data storage and 
disparate computing power. The result is duplicate applications and incompatible hardware. In 
some cases, businesses may not even have a complete understanding of their computing 
inventory. 

[0005] Ideally, an organization's server farms would be a more homogeneous group of 
servers and applications with appUcations adequately balanced across the servers in the most 
efficient and effective way. But more tjTpically. companies have an eclectic mix of computing 
products and hardware. The result is not only an inefficient computing system but also a 
burdened staff that needs to be proficient on all of the various hardware and software 
applications. To confront the issue, organizations are consolidating theit appUcations onto 
fewer, larger servers that have increased availability and scalabiUty. 

[0006] Server consoUdation can provide significant benefits, including a reduction in the 
total cost of ownership, creation of a streamlined, manageable operation, increased system 
reUabiUty, increased capacity utiUzation, and so on. Server consoUdation can give an enterprise 
the abiUty to scale processing and storage capacity without adding physical devices or 
subsystems, as well as the flexibility to partition and aUocate resources as needed. Server 
consoUdation can lead to a standardized computing environment, reducing the number of 
platforms, consoUdating software products and system interfaces, and centraUzing operation and 
systems management procedures. The result is a reduction in staff training. 

[0007] Server consoUdation generaUy can be physical or logical consoUdation. Physical 
consoUdation extends a system's scalabiUty and logical consoUdation migrates multiple 
appUcations or databases into a centraUzed appUcation or database. In addition, Physical 
consolidation can thought of as two major sub-categories, server consoUdation and storage 
consoUdation. Physical server consoUdation takes a number of servers and places their operating 
system instances into partitions or domains of a larger server. Storage consoUdation combines 
data firom different sources into a single repository and format. Storage is one of today's most 
important asset-procurement considerations in the data center, with costs that can often rival or 
exceed server costs. Since the economic Ufe of the storage exceeds that of most servers, today's 
storage decisions vdll affect operations for years to come. 

[0008] For example, if a given server has excess capacity additional appUcations can be 
moved to that server resulting in a reduction of the overaU physical number of servers. 
Moreover, organizations typicaUy configure systems to run at 50 to 60% utiUzation, leaving the 
extra capacity for peak workloads. If this unused capacity on various servers is consider for the 
number of servers in a large server farm, the amount of wasted resources can be enormous. By 
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consolidating servers, the amount of unused capacity drops as dramatically as the number of 

servers no longer needed. 

[0009] The subject patent document describes various methods and systems for 

automiating aspects of server consolidation. 

SUMMARY OF THE INVENTION 

[0010] The above-mentioned features are provided by a system and method for 
comparing computing device parameters particularly for use in consolidating services performed 
on a plurality of computing devices such as servers in a server farm. A storage device stores 
data set files that contain information collected firom computing devices such as servers in a 
server farm. The data set files contain information indicative of the characteristics of the 
computing devices. The data set files describe the information in a markup language such as 
XML. A first relational database has tables configured to accept data from the data set files. A 
set of computer-readable instruction are capable of loading the data firom the data set files into 
the tables of the first relational database so that the two data sets can be compared to each other. 

[0011] The information indicative of the characteristics of a computing device comprises 
information indicative of system parameters, executable process parameters, or database 
parameters. 

[0012] The first relational database comprises a number of tables that maintain 
information loaded ftom the data set files. For example, a system information table maintains 
the system information for computing devices. A process table, related to the system 
information table, maintains information related to executable processes on a computing device. 
And, a module table, related to the system information table, maintains information related to 
modules on a computing device that are used by a process. Moreover, a database name table 
maintains names of computing device database names. A table table, related to the database 
name table, maintains computing device database table names. And, a column table, related to 
the table table, maintains computing device .database colunm names. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] A consolidation system and method m accordance with the invention is further described 
below with reference to the accompanying drawings, in which: 

[0014] Figure 1 illustrates an exemplary diagram of a sctvct farm consolidation; 

[0015] Figure 2 illustrates further detail of a consolidation system such as would be 
used in the consolidation in Figure 1 ; 
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[0016] Figure 3 is an exemplary user interface for invoking the discovery aspect of the 
server consolidation; 

[0017] Figure 4 is a block diagram illustrating aspects of the discovery deployment 

aspect of the system; 

[0018] Figure 5 is a high level flow diagram that illustrates the overall server 

consolidation; 

[0019] Figure 6 is an exemplary user interface showing a hierarchical folder view of 

discovered server information; 

[0020] Figure 7 is an exemplary user interface for displaying details of an appUcation 

discovered on a server; 

[0021] Figure 8 is an exemplary user interface the assists in the analysis of determining 

commonality and differences among servers in a server farm; 

[0022] Figure 9 is an exemplary user interface that provides further analysis detail on 
application commonality among servers; 

[0023] Figure 10 is an exemplary user interface for viewing servers by CPU utilization 

and memory constraints; 

[0024] Figure 11 is an exemplary user interface for selecting source and target systems 

for consolidation analysis; 

[0025] Figure 12 is an exemplary user interface that indicates results of consoUdating a 

source server to a target server; 

[0026] Figure 13 is an exemplary user interface that displays the results of the process 

analysis; 

[0027] Figure 14 is an exemplary user interface for use in database consolidation and 
provide information on conomon SQL logins; 

[0028] Figure 15 is an exemplary user interface for use in a database consolidation and 
provides information on table and colvmin compatibility; 

[0029] Figure 16A is an example of a system and application database model for use in 
analysis of system and application compatibility; 

[0030] Figure 16B is an example of a database model for use in database compatibiUty 

and consolidation analysis; 

[0031] Figure 17 is an exemplary user interface for use in deploying applications to 
computer systems in a network such as in the deployment of appUcations in a server 
consolidation; 
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[0032] Figure 1 8 is an exemplary user interface for selecting deployment rules in 
connection with application deployment; and 

[0033] Figure 19 is an block diagram illustrating the deployment of application in a 
server consolidation application. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0034] A detailed description of illustrative embodiments of the present invention wiU 
now be described with reference to Figures 1-19. Although this description provides detaUed 
examples of possible implementations of the present invention, it should be noted that these 
details are intended to be exemplary and in no way delimit the scope of the invention. 

[0035] Figure 1 provides an overview of a primary aspect of the subject invention. In 
general, a consoUdatipn service 1 15 is applied to a first server farm 1 10 to inventory the 
hardware, software, and data in that server farm. Aspects of that information are used to 
consolidate the server farm 1 10 into a second server farm 120. The second server farm 120 may 
represent a consolidation of the hardware, software, data, or some combination of those items. 
The consohdation service 1 15 helps to automate aspects of the consolidation through a process 
of discovering what feamres are present in the first server farm 1 10, providing an organized way 
of analyzing the discovered features to determine redundancies, utilization of resources, etc., and 
providing tools to assist in the deployment of the second, consolidated server farm. 

[0036] A typical server farm, e.g., server farm 1 10 may have a variety of servers 1 10a 
through 1 lOf. The servers 1 10a through 1 lOf in the example server farm 1 10 may be of a variety 
of manufacturers, capabiUties, power, etc. Moreover, as illustrated, the various servers contain a 
mix of applications and data. For example, server 110a runs applications App A and App B, 
server 110b runs application App Al and maintains database Data 1, server 110c runs 
application App Bl, server llOd runs appUcation App C, server llOe runs application App CI, 
and server 1 lOf runs application App D and maintains database Data 2. Notably, the various 
applications may be various versions of the same application. For example, appUcation App Al 
may be another instance of application App A, whether the same or different version. Similarly 
appUcation App B 1 may be another instance of application App B. AdditionaUy, databases Data 
1 and Data 2 may have a number of fields in conmion such that the two databases could be 
merged into a single database. 

[0037] As noted above, consoUdation service 115 provides tools to discover the various 
servers, hardware configuration. appUcations, databases, etc. contained with in server farm 110 
for the primary purpose of consoUdating the serve* farm into server farm 120. 
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[0038] Server farm 120 provides at least all of the functionaUty previously provided by 
server farm 110. unless of course some of the functionaHty was intentionally removed during the 
consolidation. In the consolidated server farm 120. hardware may be combined, eliminated, 
upgraded etc. Similarly, applications may be consolidated to run on a single server, eliminated, 
or various version of a single application upgraded and combined, e.g., applications App A and 
App Al have been consoUdated into application App A and appUcations App B and App Bl 
have been consolidated into application App B. Additionally, database Data 1 and Data 2 have 
been consolidated into database Data 1+2. 

[0039] Figure 2 further illustrates aspects of the consolidation service running on a 
consolidation management system 1 17. Consolidation system 1 17, runs on one or more 
computing devices. The computing devices are coupled to server farm 1 10 via network 210. Of 
course, showing the consolidation system 1 17 as separate from the server farm is for illustration 
purposes only. Naturally, the service could run a server or system within tiie server farm or 
without the server farm. Additionally, server farms 1 10 and 120 are shown as separate server 
farms to iUustrate the transformation that the consolidation service facilitates. In many instances, 
the server farm 120 will be an update and consolidation of server farm 1 10 itself. That is, many 
of the servers in the server farm will be reused and or redeployed in tiie consolidated server farm. 

[0040] Discovery services 202 tiiat run as part of the consolidation service comprise a 
variety of discovery services, e.g.. AppHcation/System Discovery, SQL Server Discovery, and so 
on. The various discovery services are agents that are dispensed over network 210 to discover 
and inventory the various assets in the server farm, e.g., server farm 110. The discovered 
information on the various servers, e.g., llOa-llOf, are then stored in consoUdation database 
206. After a sufficient portion of the assets on the server farm has been discovered, analysis 
service 204 can then be used to analyze various aspects of the server farm. FinaUy, the analyzed 
information can be used to manage and deploy a consolidated server farm. e.g.. server farm 120. 

[0041] Primarily, there are two types of inventory agents: System and Application Agent 
and SQL Server Discovery Agent. There could be odier agent types as well. For example, an 
agent type could be designed to gather information on Oracle databases, IBM databases. Object 
oriented databases, etc. Together these agents capture a number of data points relative to system 
hardware, appUcation and database configurations in a Microsoft Windows operating 
environment, a Unix environment, or a Linux environment. The System and AppUcation Agent 
assists in the process of retrieving those data points necessary for analyzing existing plications 
to determine their suitability for consoUdation and to assist in the design of a consoUdated 
appUcation infrastructure. System and AppUcation Agent faciUtates the capture of a detailed 
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inventory of the cUent's existing server estate, including servers, applications, databases, devices, 
processors, memory and much more including the relationships of such information as defined in 
the System and AppUcation Agent Inventory Model (described in further detail in connection 
with Figure 16A herein below); The SQL Server Discovery agent assists in the process of 
retrieving those data points necessary for analyzing existing SQL Server database 
implementations to determine their suitability for consolidation and to assist in the design of a 
consolidated SQL Server infrastructure. Although the operation of the database discovery agent 

J :u^A u^^^;^ ..,ith T^ff-rP-nr-P tn Mir.msoft SOL Server, the description and characteristics of 

the agent also apply to Oracle database systems, suitably tailored to the particular characteristics 
of Oracle systems. 

[0042] SQL Database Agent faciHtates the capture of a detailed inventory of the cHent's 
existing SQL Server estate, including servers, SQL instances, databases, users and much more 
much more including the relationships of such information as defined in the Database Inventory 
Model (described in further detail in connection with Figure 16B herein below). 

[0043] Figure 3 provides an illustrative invocation screen to set up and start the 
discovery process. Window 302 provides various user interface mechanisms to allow a user to 
control the discovery process. Folder portion 304 aUows a user to select a storage location for 
the collected discovery data, e.g., folder "/AAM/joe". Target box 306 displays the name of the 
selected target server. Box 308 displays the hst of files in the selected folder. And tools portion 
310 allows a user to select the discovery tool to use. In this example, fhe user has selected 
"Discover System." The user could have selected an alternative discovery such as 'T>iscover 
Database." 

[0044] Notably, the targets box 306 illustrates on technique for specifying a target server 
by host name. Other techniques are also possible. For example, the system 117 could accept a 
comma separated Ust of servers or the system could query the domain controller and obtain a 
subnet Ust of IP addresses in the server farm. In general, the servers could be identified by host 
name, host Ust, TCP/IP subnet, Microsoft Active Directory site name, or domain name. Host 
name enables the user to select a single server for inventory. In that instance, the user specifies 
the name of the host machine, and a user name and password with administrator privileges. Host 
Ust enables a user to select a group of servers from a host Ust for inventory. TCP/IP subnet 
enables a user to select all servers within a specific TCP/IP subnet. In that instance, the user 
enters the network subnet address and a user name and password with administrator privileges 
for aU systems in the subnet. Site name, enables a user to select aU servers in a specific site. In 
this instance, a user enters the site name and a user name and password with administrator 
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privileges for all systems within the site. Domain name enables a user to select all servers in a 
domain. The user of the discovery tool must enter the domain name and a user name and 
password with administrator privileges for all systems within the domain. After detennining the 
list of server addresses in the server farm, e.g., server farm 1 10, the system logs-in to the target 
server, e.g., 110a, and invokes the discovery process. 

[0045] In general, the user will have to login to a target server as an administrator to 
complete the discovery process. Hence, the discovery service wiU have to have access to an 
administrator account and password. This account and password will in general, but not 
necessarily be the same on all of the servers throughout the server farm, e.g., server farm 110. 
The discovery process looks up account name and password information for each system as it is 
processed. As a result, the login process can be automated to login to each of the pluraHty of 
servers llOa-llOf in server farm 1 10 using the usemame and password and thereafter invoking 
the discovery process. The discovery operation generaUy requires the organization to make 
available an existing user ID and password or create a new user ID and password for the servers 
that are targeted for discovery. The user ID should have administrator privileges, including the 
rights to debug programs and to load and unload device drivers, and can be removed from the 
systems as soon as the discovery task is completed . 

[0046] The Discovery tool launches a remote agent into each designated servers, e.g., 
1 lOa, to capture information about all of the applications and processes running in that system. 
The agent writes the captured information back to the consoUdation computer system 117 as an 
XML file, where it is stored in consoUdation database 206. The remote agent is then removed 
from the target server, e.g., 1 10a, leaving no trace of itself. 

[0047] The discovery process generaUy employs remote procedure caUs (RPC), 
interprocess communication (IPC), and named pipes to tightly couple the parent process running 
on one computing device (i.e. the computing device hosting the consoUdation system 117) with 
the server computer, e.g., 110a, that is being discovered. RPC enables appUcations to caU 
functions remotely. Therefore, RPC makes IPC as easy as caUing a function. RPC operates 
between processes on a single computer or on different computers on a network. 

[0048] Named pipes are used to transfer data between processes that are not related 
processes and between processes on different computers. TypicaUy, a named-pipe server process 
creates a named pipe with a well-known name or a name that is to be communicated to its 
cUents. A named-pipe cUent process that knows the name of the pipe can open its other end, 
subject to access restrictions specified by named-pipe server process. After both the server and 
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client have connected to the pipe, they can exchange data by peifonning read and write 
operations on the pipe. 

[0049] Discovery is the process of harvesting system information and information about 
running processes on specifieii servers located in a server farm, and storing the information in 
database 206 of Figure 2. As the discovery operation finishes on each target server, the agent is 
removed from the server and the link to the server from the external system is terminated. In 
summary, no trace of the discovery operation should remain in the organization's system. 

[0050] Multiple discoveries can be done by scheduling discovery at specific time 
intervals to capture those applications or processes that run only at a particular time or the 
discovery operation can be run again manually. Each time the discovery operation is repeated, a 
new revision of the server XML file is created. All revisions are stored and available in the 
version history. 

[0051] The type of information discovered by Application and Process Discovery 
includes hardware information, such as the number of processors on a given system, available 
processors on a given system, processor level and revision, devices, disk drive characteristics and 
capacities, as so on. System information discovered includes system name, page size, operating 
system version, operating system build, network coimectivity, and so on. Process and 
dependency information discovered includes active processes and their associated dependencies 
(both component and configuration), processor usage at both the system and the process level, 
memory usage at both the system and the process level, process creation time, process ID, 
process owner, process handles, process and dependency versions and timestamps, process and 
dependency descriptions. 

[0052] SQL Server Database discovery is designed to facilitate SQL server 
Consolidation. It automates much of the information gathering and analysis process. It 
complements the information gathered through Process discovery. The information gathered is 
a detailed inventory of the customer's existing SQL Server estate - Servers, Instances, Databases, 
User and so on. The information collected is stored in database 206 and is used by consolidation 
system 117 during the analysis process. 

[0053] Figure 4 further illustrates aspects of the discovery process. The target server, 
e.g., 110a, is preferably selected throu^ a GUI interface as part of the overall discovery process. 
A selected discovery agent 406 is pushed on the target server with a privileged user account and 
starts collecting information into an XML file format on client machine. The XML file is stored 
in consolidation database 206 with a tracking version. As part of the load process, the 
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infonnation in the XML file is read and transformed into a series of relational records and stored 
in a cache database for query purposes. 

[0054] The consoUdation database 206 is used to store the information coUected from 
target SQL servers. The database type is preferably a relational database. In addition and not to 
be confused with consolidation database 206. there are target databases, e.g., target SQL server 
databases: Such databases are the instances where the inventory is taken from. To access these 
databases, the database discovery process requires SQL admin privileges account on the target 
SQL server. 

[0055] To connect to an instance of SQL Server, typicaUy two or three pieces of 
information are required, including the network name of the computer on which the SQL Server 
instance is running, and the instance name (this is necessary in the case where only a particular 
instance is to be discovered). 

[0056] Initially, after login, consolidation system 117 copies a procedure over to the 
target server, e.g., UOa. Li particular, it copies a remote service executable program 404 to 
admin$ share on the server computer. Thereafter, four named pipes 402 are started up as shown 
in Figure 4 between the remote service 404 and consolidation system 1 17. The four named pipes 
402, stdin, stdout, stderr, and contirol are used to facilitate communication between the 
consolidation system 1 17 and the server 1 10a. The remote service 404 establishes the 
connection between consolidation system 117 and server UOa using the named pipes 402. After 
the named pipes 402 have been established, a discovery procedure 406. e.g.. the discovery 
procedure selected fix)m the tools box 310 in Figure 3, is copied to sever UOa. 

[0057] When the discovery process 406 is in place on target server UOa, the contirol pipe 
is used to run discovery procedure 406. The named pipes 402, i.e. stdin, stdout, stderr, and 
conti-ol are routed to the discovery procedure. The discovery process 406 then performs the 
appropriate inventory collection, as described more fully below, and sends back an XML file that 
includes the data describing the assets on target server UOa. Thereafter, the discovery process 
406 terminates and then is preferably shut down and also removed from target server UOa. the 
process is then repeated for the remaining servers in the server farm 110, e.g., UOb, UOc. and so 
on. 

[0058] When the AppUcation and System discovery agent starts on the target server 
1 10a, the processes and DLLs information is collected using various system calls. To obtain a 
list of all processes in a Windows 2000 Server operating system environment, the following calls 
are used: ^^ 
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ULONG ( ^stdcall *NtQuerySystemInformation)( 

ULONG SystemlnformationClass, 
PVOID Systemlnformation, 
ULONG SystemlnformationLength, 
PULONG ReturnLength 

); 

[0059] NtQuerySystemlnformation is an internal Windows function that retrieves 

various kinds of system information. 

[0060] SystemlnformationClass indicates the kind of system information to be retrieved. 
The information includes: the number of processors in the system., information about the 
resource usage of each process, including the number of handles used by the process, the peak 
page-file usage, and the number of memory pages that the process has allocated. 

[0061] Systemlnformation points to a buffer where the requested information is to be 
returned. The size and structure of this information varies depending on the value of the 
SystemlnfonnationClass parameter: 

[0062] SystemlnformationLength is the size of the buffer pointed to by the 
Systemlnformation parameter, in bytes. 

[0063] ReturnLength is an optional pointer to a location where the function writes the 
actual size of the information requested. 

[0064] Another call is used that provides a starting address to obtain the information 
about what DULs are loaded by a process. That call is as follows: 

ULONG ( stdcall *NtQueryInformationProcess)( 

PVOID ProcessHandle, 
INT ProcessInformationClass, 
PVOID Processlnformation, 
ULONG ProcessInformationLength, 
PULONG ReturnLength ); 

[0065] ProcessHandle specifies the handle to the process for which information is to be 

retrieved. 

[0066] ProcessInformationClass specifies the type of process information to be retrieved. 
This parameter can either retrieves a pointer to a FEB structure that can be used to determine 
whether the specified process is being debugged, and a unique value used by the system to 
identify the specified process or whether the process is running in the WOW64 environment 
(WOW64 is the x86 emulator that aUows Win32-based applications to run on 64-bit Windows). 

[0067] PiDcessInformation is a Pointer to a buffer supplied by the calling application into 
which the function writes the requested information. 

[0068] ProcessInformationLength is the size of the buffer pointed to by the 
Processlnformation parameter, in bytes. 
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[0069] RetumLength is a pointer to a variable in which the function returns the size of 

the requested information. 

[0070] The information so coUected is then put into an XML file and transmitted back to 
consolidation computer system 1 17. The below XML provides an example of a portion of such 

an XML file. 

<?xml version="1.0" encoding="ISO-8859-l" ?> 

<Discovery type="Process"> ^ „^ « 

<PE Syslnfo ID="5008DJUL1030-sr discoverVersion= 2.0.0 „,„^^^^„_-_„ 
captureTimeGMT="21:10:30 30 Oct 2003" captureTimeNumeric= 1067548230 
systemName="USMV-MUTSCHGO" systemMake="Dell Computer Corporation 
systemModel="Dell WORKSTATION PWS360" osMajorVersion="5" 
osMinorVersion="l" osBuild=''2600" osRev= "Service Pack 1" pageSize= 4096 
allocationGranularity="65536" totalMemory="1072689152" 
availableMemory="634216448" totalVirtualMemory="2147352576 
availabIeVirtualMemory="2111578112" totalPageFile="2581708800J 
availablePageFile="2110324736" memoryLoad="40" cpuLoad= 1.7 
systemDirectory="C:\WINDOWS\System32V'> . „ , • 

<PE_SysInfoEx ID="EI" parent_ID="5008DJUL1030-sr' servicePackMajor= 1 
servicePackMinor="0" productType="PROD_WORKSTATION" 
installedPkg="PKG_SINGLEUSERTS" /> 

<PE_HdweInfo ID="Hr parent_ID="5008D3UL1030-SI" numberOfProcessors= 2 

availableProcessorMask="3" processorLevel="15" processorRevision="521 > 
<PE ProcessorSpeed parent_ID="HI" procNum="0" speed="2992" /> 
<PE_ProcessorSpeed parent_ID="HI" procNum="l" speed="2992" /> 

<PE_Device parent_ID="HI" deviceLocation="LPTl" cmpLocation=""> 
<deviceName>Printer Port Logical Interfece</devlcelSlame> 
</PE_Device> „ , ^. ,„. 

<PE_Device parent_ID="HI" deviceLocation="USB Device" cmpLocation= > 
<deviceName> ViewSonic Color Pocket PC V37</deviceName> 

</PE_Device> „« « n i « 

<PE_Device parent_ID="HI" deviceLocatlon="" cmpLocation= 0,0,0,1 > 
<deviceName>HL-DT-ST RW/DVD GCC-4480B</deviceName> 
</PE_Device> 

'<PE AppCatalogltem parent_ID="5008D3UL1030-sr' appName="PowerDVD" 
app\7ersion="" pubHsher="" msiGuid="{6811CAA0-BF12-llD4-9EAl- 
0050BAE317E1}"> 

<installLocation /> 

<installSource /> 

</PE AppCatalogItem> „ ^ u c 

<PE AppCatalogltem parent_ID="5008DJUL1030-SI" appName="Easy CD Creator 5 
Basic" appVersion="5.3.4.21" publisher="Roxio Inc" msiGuid="{609F7AC8-C510- 
11D4-A788-009027ABA5D0>"> 

<installLocation /> 

<installSource /> 

</PE AppCatalogItem> . ^ ^„ 

<PE AppCatalogltem parent_ID="5008DJUL1030-SI" appName="Microsoft Office 
2000 SR-1 Premium" appVersion="9.00.9327" publlsher="Microsoft Corporation 
msiGuid=''-(00000409-78El-llD2-B60F-006097C998E7>"> 

<installLocation /> . ^ „^ 

<insta!ISource>\\usmv-sms\UITSoftware\STD2000.S2A\</mstallSource> 

</PE_AppCatalogItem> , 
<PE_AppCatalogItem parent_ID="5008DJUL1030-sr appName= Microsoft SQL 
Server 2000" appVerslon="8.00.751" publisher="Microsoft" msiGuid=" > 

<installLocation>C:\Program Files\Microsoft SQL Server\MSSQL</installLocation> 
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<lnstallSource /> 
</PE_AppCata!ogItem> 



<PE_Process ID="Proc.l588" parentSystem_ID="5008DJUL1030-sr 
processName="AGENTSRV.EXE" processId = "1588" depth = "5" affinltyNlask="3" 
processOwner="NT AUTHORJTY\SYSTEM" parentProcess_ID="Proc.772" 
startT1me="09:21:25 29 Oct 2003" startTimeNumeric="1067448085" 
handIeCount="119" ba5ePriority="8" cpuTime="1441406250" 
percentCpuTime="0.1" hasServices="true" ownProcess="false" 
peakVirtualSize="187858944" pageFaultCount="127666" 
peakWorkingSetSize="107339776" workingSetSi2e=" 184320" 
q.^o|-3Pe3i,p3gedPoo!Usage="51872" quotaPagedPoolUsage="50056" 
quotaPeakNonPagedPoolUsage="20446" quotaNonPagedPoolUsage="4400" 
pagefileUsage="18952192" peakPagefileUsage="107180032" 

privatePageCount="18952192" verslon="7.0.3.0892" usedModules="Mod.0 Mod.l 
Mod. 2 Mod. 3 Mod. 4 Mod. 5 Mod. 6 Mod. 7 Mod. 8 Mod. 9 Mod. 10 Mod. 11 Mod. 12 
Mod!l3 Mod. 14 Mod. 15 Mod. 16 Mod. 17 Mod. 18 Mod. 19 Mod. 20 Mod.21 Mod. 22 
Mod. 23 Mod. 24 Mod. 25 Mod. 26 Mod. 27 Mod. 28 Mod. 29 Mod.30"> 

<descriptron> Agent Service ModLjle</description> 

<fullPath>C:\Legato Connected\AGENTSRV.EXE</fullPath> 

<commandLine>"c:\Legato Connected\AgehtSrv.EXE" -asv</commandLine> 

</PE_Process> 

<PE_Process ID="Proc.772" parentSystem„ID="5008DJUL1030-SI" 
processName="SERVICES.EXE" processId="772" depth="4" afflnityMask="3" 
processOwner="NT AUTHORITYXSYSTEM" parentProcess_ID="Proc.728" 
startTime=="09:21:21 29 Oct 2003" startTimeNumeric="1067448081" 
handleCount="365" basePriorlty="9" cpuTime="79843750" percentCpuTime="0.0" 
hasServices="true" ownProcess="false" peakVirtualSi2e="54595584" 
pageFaultCount="4985" peakWorkingSetSize="7499776" 
workingSetSize="4673536" quotaPeakPagedPoolUsage="58560" 
quotaPagedPoolUsage="35612" quotaPeakNonPagedPoolUsage="14264" 
quotaNonPagedPooIUsage="11040" pagefileUsage="3964928" 
peakPagefileUsage="4517888" prlvatePageCount="3964928" version="5.1. 2600.0 
(xpcllent.010817-1148)" usedModules="Mod.217 Mod.l Mod. 2 Mod.9 Mod. 5 Mod. 6 
Mod. 3 Mod. 4 Mod. 66 Mod. 218 Mod.212 Mod.219 Mod.84 Mod. 220 Mod. 73 Mod. 221 
Mod. 17 Mod. 18 Mod. 50 Mod. 37 Mod. 89 Mod. 65 Mod. 19 Mod. 222 Mod. 30 Mod. 223 
Mod. 7 Mod.42"> 
<description>Services and Controller app</description> 
<fullPath>C:\WINDOWS\SYSTEM32\SERVICES.EXE</fullPath> 
<commandLine>C:\WINDOWS\system32\services.exe</cornmandLlne> 

</PE_Process> 



'<PE_Module version="l. 02.0814.0000" ID="Mod.392" parent_ID="5008DJUL1030- 
sr base="1505034240" size="36864" memoryMapped="false" 
creationTlme="8/29/2002 2:00 AM"> 

<moduleDescription>WinInet Soap Connector Ubrary</nnoduleDescnption> 

< path > C:\Prog ram Files\Common Files\MSSoap\Blnarles\WISC10.DLL</path> 

<lmageName>WISC10.DLL</lmageName> 

</PE_Module> 

</PE_SysInfo> 
</Dlscovery> 

[0071] When the SQL Server discovery agent starts on the target server 1 10a, the 
following actions are performed: 
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1. The agent captures the SQL Server name and version on the target machine 1 10a. 

2. For each instance of the SQL Server on target machine 1 10a, the f oUowing 

information is captured: 

# The database schema's present is determined, and for each database schema 
information is coUected such as tables, views, indexes, roles, etc. 
User logins, permissions and roles 
User objects in the master db 

Database names and logins and database client names 
SQL configuration settings 
Collation settings 
Jobs and tasks 
SQL alerts 
Replication 
DTS packages list 

Database size and log size information 
[0072] In general, the captured data is used to detect differences between database 
objects for duplicate databases on multiple servers. The foUowing database objects are captured 
for comparison: 

[0073] Roles, Users, AUases, Defaults, Rules, Functions, User defined data types. User 
messages. Tables. Views, Indexes, Extended procedures, Stored procedures and Triggers. There 
are several methods available to capture this information. The preferred method uses T-SQL 
and collects the catalogue information from system tables. The below description illustrates an 
implementation for SQL Server available fix)m Microsoft Corporation. Nevertheless, the overall 
technique is also appUcable to other database systems such as Oracle database systems. 

[0074] SQL Server available system stored procedures are used to capture information. 
For example, a join query against Syspiocesses and sysdatabases tables captures some of the 

information as follows: 

SELECT dbs.[name], [program_name],[loginame] FROM .»,horo 
[master].[dbo].[sysprocesses] procs, [master].[dbo].[sysdatabases] dbs Where 
procs.[dbid] = dbs.[dbid] And Len([program_name]) > 0 

[0075] The function interrogates Master db for any user objects. System Stored 
procedures are used to capture the data. The function looks for user type objects in the master 
database and the ones found along with their description and contents is written to XML file to 

be stored in the cache database. 

SELECT CONVERT(char(32), host_name()) as MachineName, 
ServerName = CASE @@servername WHEN null THEN CONVERT(char(32), 
host nameO) ELSE CONVERT(char(32), @@servername) END, o.name as 
StoridProcName, u.name as OwnerNarpe FROM master..sysobjects o, 
Sr sysusers u WHERE o.uid = u.uld and o.type = 'P' and o.category = 

0 and o.name <> 'sp.helpsql' 
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[0076] To identify the potential login problems like duplicate names in more than one 
server and the conflicting permission, this function captures the logins and permissions via the 
stored procedures available. 

[0077] For each instance get the list of logins and their roles for each database within 

that instance. ' 

[0078] The configuration information such as from sp_configure, is extracted and 
compared against the default settings for a particular version of SQL Server. 

[0079] SQL Server function ServerProperty is used to collect product version, edition, 
service pack, collation, etc. as illustrated below: 

Select CONVERT(char(32), serverpropertyCCollatlon*) )as •Collation', 
CONVERT(char(32), serverpropertyCEditionO)as 'Edition', 
CONVERT(char(32), serverpropertyCEngine Edition*) )as 'Engine Edition', 
CONVERT(char(32), serverpropertyCInstanceName') )as InstanceName', 
CONVERT(char(32), serverpropertyrisClustered'))as IsClustered', 
CONVERT(char(32), serverproperty('IsFullTextInstalledO)as 'IsFullTextlnstalled', 
CONVERT(char(32), serverproperty('IsIntegratedSecurityOnly'))as 
IsIntegratedSecurityOnly', 

CONVERT(char(32), serverpropertyCIsSingleUser'))as 'IsSingleUser', 
CONVERT(char(32), serverpropertyCIsSyncWithBackup'))as 'IsSyncWithBackup', 
CONVERT(char(32), serverpropertyCLicenseType'))as 'LicenseType', 
CONVERT(char(32), serverproperty('MachlneName'))as 'l^achineNanne', 
CONVERT(char(32), serverproperty('NumLicenses'))as 'NumUcenses', 
COfs!VERT(char(32), serverpropert^'CProcessID'))as 'ProcessID', 
CONVERT(char(32), serverproperty('ProductVersion'))as 'ProductVersion', 
CONVERT(char(32), serverpropertyCProductLever))as 'ProductLevel', 
CONVERT(char(32), serverpropertyCServerName'))as 'ServerName' 
For non-2000 SQL Server some of these fields will be null. 

[OGSO] The below functions captures lists of Jobs, via sysjobs table of msdb. Alerts via 
. sys Alerts table and Operators via sysOperators for an Instance. 
Jobs: 

Select CONVERT(char(32), host_name()) as MachineName, ServerName = CASE 
@@servername WHEN null THEN CONVERT(char(32), host^nameQ) ELSE 
CONVERT(char(32), @@servername) END,* fronn msdb.. sysjobs 

Alerts: 

SELECT CONVERT(char(32), host_name()) as MachineName, ServerName = CASE 
@@servername WHEN null THEN CONVERT(char(32), host_name()) ELSE 
CONVERT(char(32), @@servername) END,[id],[Name],Event_source, 
Event_categoryJd, Eventjd, Message_id, Severity, 

Enabled, Delay_between_responses, Last_occurrence„date, Last_occurrence_time, 
Last_response„date, Last_response_tlme, Notification_message, 
Include„event_description, Database_name, Event„description_keyword, 
Occurrence^count, Count„reset_date, Count_reset_time, Jobjd, Has.notification, 
Flags, Performance_condition, Categoryjd, " as Event_category_name, " as 
Delay_between„notifications, " as Task_4d/ " as Has_emaiLnotification, " as 
Has_pager_notification FROM msdb.-sysalerts 
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Operators: 

DECLARE @SQLVersion varchar(4) 

SELECT ©SQLVersion = SUBSTRING(@@version, 23, 4) 
—Extract the information, dependant on SQL version 
IF (@SQLVersion = '6.50') 

SELECT CONVERT(char(32), host_name()) as MachineName, ServerName = 
CASE @@servername WHEN null THEN CONVERT(char(32), host_name()) ELSE 
CONVERT(char(32), @@servername) END,*, " as Netsend^address, " as 
Last_netsend_date, " as Last_netsend_time, " as Categoryjd FROM 
msdb..sysoperators 
ELSE 

IF f®SOLVersion = '7.00') or (@SQLVersion = '2000') 

SELECT CONVERT(char(32), host_name()) as MachineName, 
ServerName = CASE @@servername WHEN null THEN CONVERT(char(32), 
host_name()) ELSE CONVERT(char(32), @@servername) END,* FROM 
msdb..sysoperators 

[0081] Where replication is allowed, information is collected on databases and reported 

in a list, server, instance and dbnames along with replication role (Publisher, Distributor, 

Subscriber) and replication type. The system Store procedure 'sp.helpreplicationdboption' is 

utilized to capture replication information. To capture DTS packages info, the following SQL 

statements are exercised: 

DECLARE @SQLV/ersion varchar(4) 
DECLARE @SQLString varchar(255) 
SELECT @sqlversion = SUBSTRmG(@@verslon, 23, 4) 
IF (@SQLVersion = '6.50') 
select " 

ELSE 

IF (@SQLVerslon = 7.00') 

IF @@ServerName is not Null 

SELECT @SOLString = 'SELECT CONVERT(char(32), 
host_name()) as MachineName, CONVERT(char(32), @@servername) as 
ServerName,name,id,versionid,cast(descriptlon AS char(25)) as ShortDescription, 
categoryid,createdate,owner, owner_sid, as PackageType from 
msdb..sysdtspackages' 
ELSE 

SELECT @SQLString = 'SELECT CONVERT(char(32), 
host„name()) as MachineName, CONVERT(char(32), host_name()) as 
ServerName,name,id,versionid,cast(description AS char(25)) as ShortDescription, 
categoryid,createdate,owner, owner_sid, as PackageType from 
msdb-.sysdtspackages' 
ELSE 

IF ©SQLVerslon = '2000' 

IF @@ServerName is not Null 

SELECT @SQLString = 'SELECT CONVERT(char(32), 
host_name()) as MachineName, CONVERT(char(32), @@servername) as 
ServerName,name,id,versionid,cast(description AS char(25)) as ShortDescription,^ 
category id, createdate,owner, owner_sid,packagetype from msdb..sysdtspackages' 

EISE 

SELECT ©SQLStrlng = 'SELECT CONVERT(char(32), 
host_name()) as MachineName, CONVERT(char(32), host_name()) as 
ServerName,name,id,versionld,cast(description AS char(25)) as ShortDescription,^ 
categoryid,createdate,owner, owner„sid,packagetype from msdb..sysdtspackages' 
EXEC(@SQLStrlng) 

-17- 



wo 2004/086186 



PCTAJS2004/008498 



[0082] In order to get the database size and log size for each database dbsize (used and 
ftee), and logsize (used and free) are used and reported with server/instance/dbname. The below 
is sample code to go to each database and execute stored procedure 'sp.spaceused' to capture 

some of the information. 

DECLARE AllDatabases CURSOR FOR SELECT name FROM sysdatabases -WHERE 
dbid > 4 

OPEN AllDatabases 

DECLARE @DBNameVar VARCHAR(128) 

DECLARE ©Statement VARCHAR(255) 

FETCH NEXT FROM AllDatabases INTO @DBNameVar 

WHILE (@@FETCH_STATUS = 0) 

BEGIN 

SELECT ©Statement = 'USE ' + @DBNameVar + CHAR(13) 

+ ' exec sp_spaceused' 
EXEC (©Statement) 

FETCH NEXT FROM AllDatabases INTO ©DBNameVar 
END 

CLOSE AllDatabases 
DEALLOCATE AllDatabases 

[0083] To capture log size information, the following SQL statement is used: 
DBCC SQLPERF(LOGSPACE) WITH NO_INFOMSGS 

[0084] The database infomation captured is formatted into an XML file and transmitted 
back to the consolidation system 1 17. An example portion of such and XML file is as foUows: 
<?xml verslon="1.0" encoding="ISO-8859-l" ?> 

<Discovery type="Database"> „ . ^ w . »c i ->t^nn 

<DD_Server machineName="USMV-VAZEHGMMl" windowsVersion= 5.1.2600 
Service Pack 1 Build 2600" discoveiVersion="2.0.0" processorCount="l 
processorActiveMask='"' OS_Name="Windows_NT 5" systemName="USMy- 
VAZEHGMMl" systemManufacturer="DelI Computer Corporation" systemModel- Dell 
OPnPLEX GX260" systemType="x86" processor="x86 Family 15 Model 2 Stepping 4 
Genuinelntel ~ 2000 MHz" BIOSVersion="DELL - 6" locale="United States" 
timeZone="Padfic Standard Time" windowsDirectory="C:\WINDOWS" 
bootDevice="\Device\HarddiskVolume3" systemDirectory="C:\WINDOWS\System32 
physicalMemory="1046524.00" availablePhyslcalMemory="102700.00" 
virtualMemory="2097024.00" avallableVirtualMemory="2D40440.00" 
pagefileSpace="0.00"> 



<DD Database serverName="USMV-VAZEHGMMl\DESKTOPSERVER" 
dbName="Analysis" owner="sa" created="Sep 3 2003" status="Status=ONUNE, 
Updateability=READ_WRITE, UserAccess=MUL-n_USER, Recovery=SIMPLE, 
Version=539, Collation=SQL_Latinl_General_CPl_CI_AS, SQLSortOrder=52, 
IsAutoClose, IsAutoShrink, isTornPageDetectionEnabled, IsAutoCreateStatlstics, 
IsAutoUpdateStatistics" compatibility_level="80" logSize="0.00" 
logspaceUsed="0.00" IStatus="" dbSlze="24.06" unalloc_s="1.91" 
reserv s="21664.00" data_s="15552.00'' index_s="5736.00" unused_sp="376.00 
transPublish="0" mergePublish="0" dbOwner="True" readOnly="False"> 

<DD_SchemaInfo> 
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<DD_Table serverName="USMV-VAZEHGMMl\DESKTOPSERVER" 
dbName="Analysis" tableName="DD_Alert"> 

<DD_Column serverName="USMV-VAZEHGMMl\DESKTOPSERVER" 
dbName="Analysis" tabIeName="DD_Alert" columnName="fileVerslon" colid="l" 
coltYpe="nvarchar" collen="510" colprec="255" colscale="0" isnullable="0" 
collation = "SQL_Latinl_GeneraLCPl_CI_AS" /> 

</DD_Table> 

<DD_User serverName="USMV-VAZEHGMMl\DESKTOPSERVER" 
dbName="Analysls" loginName="" groupName="" userName="guest" /> 
</DD_SchemaInfo> 
</DD_Database> 

</DD_Instance> 

</DD_Server> 

</Dlscovery> 

[0085] Here is a more detailed XML layout for the Schema information part only. 
[0086] For each database within an SQL instance, there is an element called 
<SchemaInfo> containing the information. 



<SchemaInfo 
<TableInfo 

<ColumnInfo name = ''columnName goes here" 

Description = ''column description goes here" /> 
<ColumnInfo name = ''columnName goes here" 

Description = ''column description goes here" /> 
<ColumnInfo name = ''columnName goes here" 

Description = '"column description goes here" /> 



more columns 

<TriggerInfo name = "trIggerName " Description = " trigger description" /> 



.......additional triggers 

<ConstraintInfo name = ''constraintName " 

Description = "constraint description" /> 



additional constraints 

<IndexInfo name = ^indexName ^ Description = ^Mndex description" /> 



additional indexes 

</TableInfo> 



additional tables go here 

<ViewInfo name = 'ViewName goes here" 

Description = "'view description goes here" </ViewInfo> 



more views 

<UdtInfo name = ''UDTName goes f^ere" 

Description = "UDT description </UdtInfo> 
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more user-defined types 

<FunctionInfo name = ''functionName goes here" ^,r=..«^i«r,Tr.f«^ 
Description = "function description goes iiere </FunctionInfo> 

more user-defined functions 

<SPInfo name = "stored-procedureName goes here" ^ ^/coiof^-. 

Description = ''Stored-procedure description goes here </SPinro> 

more user stored-defined procs 

<DefaultsInfo name = "defaultName goes here" ^,r^«^:=.Ml»■Tnfo^ 
Description = "default description goes here </DefaultInfo> 

more defaults in here 

<RuleInfo name = "ruleName goes here" . ^ * ^ 

Description = "rule description goes here" </RuleInfo> 

more rules in here 

<UserInfo name = "userName goes here" 

Description = "user description goes here" </UserInfo> 

more user info in here 

<User|V|sgInfo name = "userMsgName goes here" ^„ ,o«,.McnTnfn> 

Description = "userMsg description goes here" </UserMsgInfo> 

more user messages info in here 

</SchemaInfo> 

[0087] After the information for a particular server has been discovered, the process is 
repeated for another server, e.g.. 110b, until aU of the servers of interest in a server farm, e.g., 
1 10, have been discovered. After a sufficient number of the servers has been discovered, and 
more likely after a substantial number of the servers have been discovered, the analysis tools can 
be used to assist in aspects of the consolidation process. 

[0088] Analysis tools interpret and generate reports from the information obtained 
during the discovery process. Any of the discovery files can be opened, mcluding revisions of 
each file. Thus, the analysis process can be tailored to focus on any subset of discovered server 
assets. Once die set of discovery files are opened, the analysis tools summarizes the number of 
systems and processes being analyzed. 

[0089] Although the analysis is described herein below in the context of server 
consoUdation wherein the appUcations, databases, etc. are move to one or more other target 
servers, the analysis aspects and indeed many of tiie tools described herein also apply to a single 
server That is, aspects of a server can be compared to itself at different points in time. Hence, it 
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is important to note that the discovered XML files described above are maintained by server by 
time. This allows two forms of time-based analysis. In one case, the processes in use and 
system loading for a server can be examined as they change over time. In the other case, a server 
can be compared to itself after consolidation activities have occurred. That will aDow a 
consolidation to be rolled back. For instance, if an application and its dependencies were moved 
from a source server to a consolidation target server and the application and some or all of its 
dependencies were subsequently removed from the source server, the analysis tools described 
herein will allow aU of the features to be applied in comparing one version of a server's 
inventory to a different version of the same server's inventory. In that way, a user can revert 
back to an early system state. Similarly, the system could be used to track what inventory was 
added to a particular server and at what version the additions were made. In this way, the 
analysis tool may allow a user to quickly identify which applications were added to a server that 
may have caused it to exceed utiUzation criteria. The important point is that the tools described 
herein apply to other contexts than the context of comparing a source server to a target server for 
the purpose of consolidation. 

[0090] Reports that highlight opportunities for application consolidation and application 
coexistence can be generated. For example, the Conamon Processes report lists the processes 
running on two or more systems within the server farm. Applications associated with conamon 
processes are consolidation candidates. The analysis tools provide custom report output, sorted in 
any manner, on any stored attribute, 

[0091] Reports can be generated based on queries of any of the following data elements: 

© Hardware Information 
• Number of processors on a given system 
Available processors on a given system 
Processor level and revision 
Devices on a PCI bus 

Non-network disk drives on a system and characteristics of the drives 
System Information 
System name 
Operating system version 
Operating system build 
Total and available memory 
Applications 
Application name 
Application version 
Processes 

Process name and process ID 
Process owner 
Process dependencies 
Process and dependency descriptions 
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• Process and dependency versions and timestamps 

• Actual memory and virtual memory 

• Memory paging 

• Processor usage 

• Actual CPU time 

• Number of handles open on a process 

[0092] Figure 5 provides a flow chart of the general process involved in analyzing the 
collected data for the purpose of consolidation. The figure uses the example of appUcation 
consolidation. Nevertheless, a very similar process will happen for data consoUdation. 
Obviously, if all of the applications and data on a given server are consolidated to other servers, 
that server is a candidate for removal from the server far altogether, resulting in a physical 
consolidation. 

[0093] Initially, a determination is made whether data has been discovered for a server or 
servers of interest (step 502). An initial high level analysis is made to determine potential 
consolidation candidate servers (step 504. 506). This process is described more fully below in 
connection with the analysis user interface figures. At step 508. a determination is made 
regarding the potential benefit of a consolidation. If there is a potential benefit, then all of the 
necessary data for consolidation is collected (step 510). This may already have happened, if so 
that step can be skipped. However, all of the detailed information necessary for consoUdation 
should be available such as an application and all of its dependent modules, or a database and all 
of its tables and columns (step 512). Thereafter, an analysis is perfonned to determine the 
common components on the candidate servers, e.g., the number of appUcations and modules that 
are common between the candidate servers. Next a Ust of potential consolidation groupings are 
made, e.g.. the e-mail applications can be grouped together on one machine (steps 514. 516). 
After the candidate appUcations and/or databases are identified, the dependencies are compared 
for variations, e.g., is the DLL on one candidate server the same version as a DLL on the other 
server (steps 518. 520). After the appUcations and/or databases have been consoUdated, 
performance values of the consoUdated server.are measured to ensure that it has the capacity to 
perform the added tasks (steps 522. 525). Thereafter, the entire process can be repeated and new 
information discovered for the consoUdated server farm to determine whether fiather 
consoUdation is beneficial. 

[0094] Figure 6 provides an illustration of an exemplary user interface (UI) for use in 
consoUdation analysis. Window 600 provides an interface for users to browse through the 
various files of discovery information coUected from the servers in the server farm of interest, 
e.g., 110. To that end Window 600 has a pane 602 with a hierarchically arranged catalog of 
server information arranged into folders. By selecting one of the folders, displayed in pane 602. 
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the user is presented in pane 604 with a catalog of the XML files (described above) that have 
been collected from the various servers. Notably, each of the XML files contains a time stamp 
606 and version number 608. That allows information to be discovered on the same server at 
different times and to monitor server changes. 

[0095] Figure 7 depicts an example of a portion of the UI that assists in the analysis of 
server consolidation by allowing a user to view all of the inventory of discovered servers. 
Window 700 is divided into two panes 702 and 703. Pane 702 provides a hierarchical view of 
the discovered information for a sen/er. Here for example, a user has opened a hierarchical view 
of the system inventory for server OTG-SYS-3 and has selected Applications and Adobe 
Acrobat 5.0 (704) in particular. The attributes 706 and corresponding values 708 for that 
application are displayed in pane 703. 

[0096] Figure 8 depicts an example of a portion of the UI that assists in the analysis of 
server consolidation by presenting a graphic of the eonunonality of applications on selected 
servers. Window 800 provides a view of three pie charts 802, 804, and 806. Pie chart 802 
graphically depicts the applications that appear on more than one server with those applications 
that have different and the same versions appearing in different colors or shading. Here for 
example, pie chart 802 shows that there is a very high commonality of applications on selected 
servers, suggesting that benefits may be gained through consolidation. Similarly, pie chart 806 
indicates the amount of conmionality of process and shows a high commonality in this example. 
Pie chart 804 provides a graphic depiction of the commonality of process dependencies in the 
servers of interest. The details of the commonality can be viewed in more detail as shown in 
Figure 9. 

[0097] Figure 9 provides an example portion of the UI that provides further details on 
process commonality. Window 900 is divided into two panes 902 and 904. Pane 902 provides a 
listing of the servers in the server farm to undergo consolidation analysis, e.g., server farm 1 10. 
Pane 904 provides a list of processes by process name 906. Pane 904 also shows which server 
the process 908 is on, along with the discovery information revision 910. From this window 
900, a user can further analyze candidate servers for consolidation by determining which servers 
are ranning key processes in common. 

[0098] Additional analysis functions provide an indication of memory and processor 
loads and assist in identifying servers that are underloaded or overloaded. Servers that are 
underloaded may be candidates to have their applications consolidated on to another server. 
Additionally, servers that are already overloaded are not good candidates to accept additional 
applications in a consolidation and may, in fact, benefit from have one or more of its applications 
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moved to another server. Figure 10 provides an example UI to display CPU and memory 
utilization. Window 1000 has two panes 1002 and 1004. Pane 1002 provides a hierarchical 
listing of server inventory. Pane 1004 provides a display showing the combined average CPU 
and memory utiUzation for servers in the system and help with compatibility analysis. Bar 1006 
provides a graphic indication of the CPU and memory load on a particular server and has a 
portion 1006a that indicates CPU load and a portion 1006b that indicates memory load. SUdes 
1008 and 1010 provide a mechanism by which a user can filter the results, i.e., by setting the 
slide 1008 a user can exclude those systems from the display whose minimum CPU utilization is 
less than the threshold set by the sUder and by setting the sHde 1010 a user can exclude those 
systems whose CPU utilization exceed the maximum CPU utiHzation threshold set by the slider. 
Similarly, slides 1012 and 1014 allow a user to filter on memory utilization by setting the 
minimum and maximum thresholds. The filter allows a user to quickly identify source servers 
that are candidates for consolidation. The Min uptime hours spin box 1016 can be changed to 
exclude those systems from the display whose time of operation since the last restart is less than 
the number of hours indicated. 

[0099] Figure 11 provides further details on the analysis tools provided for server 
consolidation. Here Window 1110 provides two panes 1 102 and 1 104. Pane 1 102 Usts all of the 
servers in the server farm, e.g., 1 10 that have been discovered by the System and Application 
discovery tool. Pane 1104 provides a mechanism for a user to select process or system 
compatibility by way of radio buttons 1 104 and 1 106. In this example, the user has selected 
system compatibility analysis. Thereafter, a use can select a source system 1108, e.g., a server 
candidate for consoUdation and one or more target systems 1110. Source system processes are 

display in box 1112. 

[0100] Figure 12 further details the analysis by display indicators of the result of 
consoUdating the source server to the target server. Window 1200 provides the results of the 
selections made in Window 1100 as shown in Figure 11. Window 1200 displays the results of 
consolidating selected source server OTG-TEST-SRV3[1.2] on to target server OTG-TEST- 
SRV2[1.2]. The target system is displayed in colvunn 1202. Column 1204 indicates how many 
DLLs are the same on the source and target servers and column 1206 indicates how many 
common DLLs are different. A common DLL is one that is used by all applications in the 
system, e.g., by being located in the Windows System32 directory. Column 1208 indicates the 
target load percentage prior to consolidation and column 1210 indicates the target load 
percentage after consolidation. CPU utiUzation values from the source server are normalized to 
the processing power of the target server. Similarly columns 1214 and 1216 display the impact 
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on the memory of the target machine. Memory load values from the source server are 
normalized to the size of the memory on the target server. This display allows a user to quickly 
determine if the consolidation of the source server to the target server keeps the target server 
within utilization targets and also provides an indication of how many additional DLLs will need 
to be loaded onto the target server to support the applications moved from the source, server. 

[0101] In addition to system compatibility, process compatibility is an important 
consideration in detemiining which servers to consolidate. When the Process compatibility 
detail choice 1106 is made in pane 1100 of Figure 11, the source system processes list box 1112 
is enabled, and the user chooses one or more of the processes. The user then chooses a single 
target server from the Target Systems list box 1 1 10. Figure 13 provides a UI that displays the 
result of the process analysis and assists a user in determining process compatibility. Window 
1300 displays a comparison of common DLL compatibility and differences on the source and 
target server. Column 1302 displays the common DLL name, column 1204 displays its version 
and column 1306 indicates whether that colunm is present ("1') or absent ("0") on the target 
server. Moreover, even if the DLL is present on the target server, column 1308 provides and 
indication of whether the versions on the source and the target are the same ("1") or different 
("0")- When the version of the DLL on the target system is different, colunm 1310 contains the 
version tliat was found on the target system. As is illustrated in here, many of the DLLs on the 
source are also present on the target server; however, the target version does not match the 
source version. Colunons 1304 and 1310 provide the version of the source and the target DLL 
versions, respectively. In this way, a user can quickly determine whether the target version is a 
newer version of the DLL, perhaps alleviating the need to update. 

[0102] Figures 14 and 15 provide many of the same analysis tools as those provided 
above in the context of database consolidation. In addition to consolidating applications and 
processes on servers, database consolidation is also an important aspect of consolidation. 
Database consolidation requires an understanding of how database schemas vary among 
databases or database instances on various servers. More particularly, database consolidation 
may be available by the recognition that multiple database, while not identical, may have enough 
information in common that can be combined. This commonality requires, at least initially, that 
the target database have all of the columns in the source database or a sufficient number of 
columns of the source database and the ability to add columns and or table from the source 
database. Thereafter, addition needs can be addressed such as moving triggers, stored 
procedures, alerts and the like to the target database. 
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[0103] Figure 14 provides a high level view of the common SQL server logins. In this 
example, windov^^ 1400 is divided into two panes 1402 and 1404. Pane 1402 provides a Usting of 
database inventory that was collected for the servers during discovery as indicated above. Pane 
1404 Ust all of the common SQL Logins that were discovered on the multiple in the server farm, 
e.g., 110. Column 1406 provides the login name for the databases. Colunm 1408 provides the 
instance name. Hence a user can easily determine which databases with common login names 
are on which servers. 

[0104] When the database Compatibility details choice 1114 is made in pane 1100 of 
Figure 11, the user can perform database compatibiUty analysis. Figure 15 provides additional 
information necessary to analyze database compatibility. In this example, window 1500 
provides two panes 1502 and 1504. Pane 1502 is identical to pane 1402. Pane 1504 provides a 
listing of table and column names and provides an indication of schema commonality and 
differences. Column 1508 provides a Usting of table names and columns names for the tables in 
question. Column 1 506 provide a item type that identifies whether the item Usted in column 
1508 is a database table or database column. Column 1510 provides an indication whether the 
item in column 1508 is present on ("1") or absent from ("0") the target server. Column 1512 
provides an indication whether the items on the source and the target are compatible ("1"), 
incompatible ("0"), or whether that cannot be determined ("???"). 

[0105] Figure 16A and 16B provide further details on the implementation of the analysis 
tools described above. In particular, the selected XML files for the selected system and database 
inventory are loaded into database 206 (See Figure 2). SQL queries are then run against the data 
in the database to perform the analysis, i.e., to compare inventory in one server with the 
inventory in another server. Figure 16A provides a high level view of a schema 206a that could 
be used to store the coUected XML data. The schema iUustrates the kind of tables that could be 
used. The XML data could be loaded in the SQL database according to know techniques such as 
XML Bulk Load or other SQLXML commands. 

[0106] Preferably, a more flexible approach would be used. In such an implementation, 
an XML loader uses Microsoft XMLParser to parse the XML contents into datasets. The 
datasets are then used to build relational records and stored into a relational database, e.g., 
database 206. 

[0107] Schema 206a contains Sysinfo table 1602 which contains information such as the 
system name, make, and model number, system memory information, as well as information 
about tiie source of tiie data, i.e., which XML file and version number. Hardwarelnfo table 1604 
contains server hardware information such as number of processors and available processors. 
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Network table 1608 contains a variety of network infonnation such as NIC identifiers, IP 
addresses, and so on. Device table* 1610 contains information on hardware devices such as 
device names. Drive table 1606 contains server drive information such as total byte storage, 
bytes free, volume name, and so on. Application table 1612 contains information such as 
application name and version number. Process table 1614 contains information on processes 
such as process owner, cpu utilization information, memory utilization information, and so on. 
Module table 1618 contains module information such as module size, module name, and so on. 
Process Module Association table 1616 associates modules with parent processes. 

[0108] Schema 206a is useful in perfonning system inventory analysis for such things as 
application consolidation. With respect to database analysis. Figure 16B illustrates an high level 
schema for use with the database inventory XML files. As such, selected database XML files 
that were discovered from the various servers as described above are loaded into database 206 in 
accordance with schema 206b. Server table 1620 keeps the information identifying which 
server maintains the discovered database. Instance table 1622 keeps information on the names of 
one or more instances of database servers installed on the server, e.g., SQL Server 6.0 and SQL 
Server 7.0. For each instance, database table 1624 contains information on one or more 
databases within that instance. For each database in table 1624, Table table 1626 has all of the 
table names and Column table 1628 maintains all of the columns for a given table. Procedure 
table 1632 maintains information such as the names of stored procedures used in a database. 
Function table 1636 maintains a list of function names associated with a database. Trigger table 
1640 maintains a list of trigger names associated with a database. DBRole table 1644 maintains 
a list of database roles associated with a database. Additionally, for each instance in Instance 
table 1622, DTSPackage table rhaintains information related to the data transformation services 
packages associated with that database such as the name of the package and the owner. Login 
table 1638 maintains login infonnation such as user name. Finally, Server Role table 1642 
maintains information related to the server role such as member name and member SID. 

[0109] After the analysis has been completed and consolidation candidates have been 
identified, there may be a significant number of files that have to moved and/or loaded on the 
target server. Figures 17 and 18 illustrate aspects of the subject system that assist in automating 
at least aspects of the deployment of the new assets to a target server. Figure 17 provides an 
example asset deployment UI. Window 1700 has drop down box 1702 wherein deployment tool 
has been selected. Select box 1708 provides a mechanism for a user to identify a target server to 
which assets are to be deployed. Fane 1706 identifies all of the various assets to be deployed on 
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the target server. Notably, box 1704 provides a user with the capabiHty to define deployment 
rules to be used in association with the deployment of assets on the target server. 

[0110] After a user has determined that deployment rules should be used, selecting 
define button 1705 causes a rules editor to launch. Figure 18 further illustrates the rules editor. 
Window 1800 provides an example listing of predefined rules templates including the following 
template: 

Check for minimum disk space on a drive; 

Check for minimum memory (RAM); 

Check for minimum number of processors; 

Check if a copy of this application is already installed; 

Make sure that a conflicting application is NOT installed; 

Make sure that a required application is already installed. 

Of course other rule templates could be defined without departing from the scope of this aspect 
of the subject system. 

[Dili] Figure 19 further illustrates aspects of the deployment system. Here, 
consolidation information has been collected and analyzed, as described herein above. 
Thereafter, the consolidated server farm 120 is to be deployed. To that end, all of the 
executables, binaries, and essentially aU of the files necessary to perform an installation are 
placed into a folder with a setup file. TypicaDy this wiU be a single appUcation per folder but 
need not be so limited. Additionally, the templates are selected for the deployment. For 
example, if TniniTnnTn memory is selected, then a user vnH define the minimum memory 
requirements, e.g., 512 MB. Similarly parameters are defined for other selected templates, e.g., 
2 processors, 1 gigabytes of disk space, and so on. At some point, the target servers are selected 
for deployment. As illustrated in Figure 19, servers 120a and 12qb were selected. Alternatively 
an entire domain may be selected. As described above in connection with the discovery aspects 
of the system, the assets of the target systems are discovered. This could have been performed a! 
part of the initial consolidation process or could be performed independently. 

[0112] The relevant XML files cont*uning the discovered information is then parsed and 
compared to the defined rules. If the rules pass, the files are transmitted to the target server or 
servers and the installation and a remote procedure call is made to start the installation. 
Preferably, the transmitted instaU files are compressed before transmitting and decompressed on 
the target. Preferably the compression is performed by ZIPPING the configuration files before 
transmission and unZIPPING the configuration folders at the target server. The unzip program 
may be sent as part of the process, for example, by bundling the unzip program as a self 
extracting file. 
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[0113] Preferably, the testing of the defined rales is performed by an XPATH query 
against the XML file. For example, using the example XML file defined above in connection 
with the discovery, an XPATH query for the number of processors would return a "2" if applied 
against the below XML excerpt: 

<PE_HdweInfo ID="HI" parent„ID="5008DJUL1030-SI" numberOfProcessors="2' 

availableProcessorMask="3" processorLevel="15" processorRevision="521"> 
<PE_ProcessorSpeed parent_ID="HI" procNum="0" speed="2992" /> 
<PE_ProcessorSpeed parent_ID="HI" procNum="l" speed = "2992" /> 

<PE_Device parent_ID="Hr devlceLocation="LPTl" cmpLocatlon=""> 
<deviceName>Prlnter Port Logical Interface</devlceName> 
</PE_Device> 

<PE_Devlce parent_ID="HI" deviceLocation="USB Device" cmpLocatlon=""> 
<devlceName>ViewSonic Color Pocket PC V37</devlceName> 
</PE_Device> 

<PE_Device parent_ID="Hr" deviceLocation="" cmpLocation="0,0,0,l"> 
<deviceName>HL-DT-ST RW/DVD GCC-4480B</devlceName> 
</PE_Devlce> 

[0114] Similar XPATH queries could be applied for other rule values. 

[0115] The above deployment may be used in contexts other than the consoUdation 
context. For example, a company may want to deploy an application across a number of client 
machines throughout its organization. The above technique would allow a single deployment 
setup to automatically install the applications on the selected machines that meet the defined 
rules. 

[0116] The above consolidation in an example description only and is not intended to 
indicate that applications and databases are consolidated in all server consolidations. Rather, the 
example is intended to indicate the breath of consolidation that may be possible;' The 
overarching theme is that consolidation 115 provides the tools to determine the inventory of 
hardware, software, and data on a server farm such as server farm 1 10 and simplify the 
consolidation of that hardware, software and data. 

[0117] Elements of embodiments of the invention described below may be implemented 
by hardware, firmware, software or any combination thereof. The term hardware generally 
refers to an element having a physical structure such as electronic, electromagnetic, optical, 
electro-optical, mechanical, electro-mechanical parts, while the term software generally refers to 
a logical structure, a method, a procedure, a program, a routme, a process, an algorithm, a 
formula, a function, an expression, and the like. The term firmware generally refers to a logical 
stmcture, a method, a procedure, a program, a routine, a process, an algorithm, a formula, a 
function, an expression, and the like that is impleniented or embodied in a hardware structure 
(e.g., flash memory, ROM, EROM). Examples of firmware may include microcode, writable 
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control store, and micro-programmed structure. When implemented in software or firmware, the 
elements of an embodiment of the present invention are essentially the code segments to perform 
the necessary tasks. The software/firmware may include the actual code to carry out the 
operations described in one embodiment of the invention, or code that emulates or simulates the 
operations. The program or code segments can be stored in a processor or machine accessible 
medium or transmitted by a computer data signal embodied in a carrier wave, or a signal 
modulated by a cairier, over a transmission medium. The "processor readable or accessible 
medium" or "machine readable or accessible medium" may include any medium that can store, 
transmit, or transfer information. Examples of the processor readable or machine accessible 
medium include an electronic circuit, a semiconductor memory device, a read only memory 
(ROM), a flash memory, an erasable ROM (EROM), a floppy diskette, a compact disk (CD) 
ROM, an optical disk, a hard disk, a fiber optic medium, a radio frequency (RF) link, and the like. 
The computer data signal may include any signal that can propagate over a transmission medium 
such as electronic network channels, optical fibers, air, electromagnetic, RF links, etc. The code 
segments may be downloaded via computer networks such as the Internet, Intranet, etc. The 
machine accessible medium may be embodied in an article of manufacture. The machine 
accessible medium may include data that, when accessed by a machine, cause the machine to 
perform the operations described in the foUowing. The machine accessible medium may also 
include program code embedded therein. The program code may include machine readable code 
to perform the operations described in the following. The term "data" here refers to any type of 
information that is encoded for machine-readable purposes. Therefore, it may include programs, 

code, data, files, and the like. 

[0118] All or part of an embodiment of the invention may be implemented by hardware, 
software, or firmware, or any combination thereof. The hardware, software, or firmware element 
may have several modules coupled to one another. A hardware module is coupled to another 
module by mechanical, electrical, optical, electromagnetic or any physical connections. A software 
module is coupled to another module by a function, procedure, method, subprogram, or 
subroutine call, a jump, a link, a parameter, variable, and argument passing, a fimction return, 
and the like. A software module is coupled to another module to receive variables, parameters, 
arguments, pointers, etc. and/or to generate or pass results, updated variables, pointers, and the 
like. A firmware module is coupled to another module by any combination of hardware and 
software coupUng methods above. A hardware, software, or firmware module may be coupled to 
any one of another hardware, software, or firmwaig module. A module may also be a software 
driver or interface to interact with the operating system running on the platform. A module may 
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also be a hardware driver to configure, set up, initialize, send and receive data to and from a 
hardware device. An apparatus may include any combination of hardware, software, and 
firmware modules. 

[0119] Embodiments of the invention may be described as a process which is usually 
depicted as a flowchart, a flow diagram, a structure diagram, or a block diagram. Although a 
flowchart may describe the operations as a sequential process, many of the operations can be 
performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. 

A process is terminated when its operations are completed, 

[0120] Those skilled in the art also will readily appreciate that many additional 
modifications are possible in the exemplary embodiment without materially departing from the 
novel teachings and advantages of the invention. Any such modifications are intended to be 
included within the scope of this invention as defined by the following exemplary claims. 
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What is Claimed: 

1. A method for use in consolidating computing devices, comprising: 

storing in at least two data set files containing information indicative of the 
characteristics of at least a first computing device wherein the data sets describe the information 

in a markup language; 

loading the at least two data sets into a first relational database so that the at least two 

data sets can be compared to each other. 

2. The method as recited in claim 1 wherein one of the at least two data sets contains 
information indicative of a second computing device. 

3. The method as recited in claim 1 wherein the markup language comprises XML. 

4. The method as recited in claim 1 wherein the infonnation indicative of the characteristics of a 
computing device comprises information indicative of system parameters. 

5. The method as recited in claim 4 wherein the system parameters comprise at least one of: the 
number of processors, available processors, processor level, devices, disk drive characteristics, 
disk drive capacity, system name, page size, operating system version, operating system build, 
and network connectivity. 

6. The method as recited in claim 1 wherein the information indicative of the characteristics of a 
computing device comprises information indicative of executable process parameters. 

7. The method as recited in claim 6 wherein the executable process parameters comprise at least 
one of: CPU utiUzation, memory utilization, active processes, active process dependencies, 
processor usage, memory usage, process creation time, process ID, process owner, process 
handles, process version, dependency version, process timestamp, process description, and 
dependency description. 

8. The method as recited in claim 7 wherein the first relational database comprises a system 
information table for maintaining the system information for at least one computing device. 

9. The method as recited in claim 8 wherein the first relational database comprises a process 
table related to the system information table, wherein the process table maintains information 
related to executable processes on a computing device. 
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10. The method as recited in claim 9 wherein the first relational database comprises a module 
table related to the system information table, wherein the module table maintains information 
related to modules on a computing device that are used by a process. 

11. The method as recited in claim 1 wherein the information indicative of the characteristics of 
a computing device comprises information indicative of computing device database definition 
parameters. 

12. The method as recited in claim 11 wherein the computing device database definition 
parameters comprise at least one of: database names, roles, users, aliases, defaults, rules, 
functions, user defined datatypes, user messages, tables, views, indexes, extended procedures, 
stored procedures, and triggers. 

13. The method as recited in claim 12 wherein the first relational database comprises a database 
name table for maintaining the names of computing device database names. 

14. The method as recited in claim 13 wherein the first relational database comprises a table 
table related to the database name table, wherein the table table maintains computing device 
database table names. , 

15. The method as recited in claim 14 wherein the first relational database comprises a column 
table related to the table table, wherein the column table maintains computing device database 
column names. 

16. The method as recited in claim 13 comprising a second table related to the database name 
table wherein.the second table is a table comprising at least one of: trigger, procedure, database 
role, function, and procedure. 
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17. A system for comparing computing device parameters, comprising: 

a storage device storing at least two data set files containing information indicative of the 
characteristics of at least a first computing device wherein the data set files describe the 
information in a markup language; 

a first relational database having tables configured to accept data from the data set files; 

and 

a set of computer-readable instruction capable of loading the data from the at least two 
data sets into the tables of the first relational database so that the at least two data sets can be 
compared to each other. 

1 8. The system as recited in claim 17 wherein one of the at least two data sets contains 
information indicative of a second computing device. 

19. The system as recited in claim 17 wherein the markup language comprises XML. 

20. The system as recited in claim 17 wherein the information indicative of the characteristics of 
a computing device comprises inforaiation indicative of system parameters. 

21. The system as recited in claim 20 wherein the system parameters comprise at least one of: 
the number of processors, available processors, processor level, devices, disk drive 
characteristics, disk drive capacity, system name, page size, operating system version, operating 
system build, and network connectivity. 

22. The system as recited in claim 17 wherein the infonnation indicative of the characteristics of 
a computing device comprises infonnation indicative of executable process parameters. 

23. The system as recited in claim 22 wherein the executable process parameters comprise at 
least one of: CPU utilization, memory utilization, active processes, active process dependencies, 
processor usage, memory usage, process creation time, process ID, process owner, process 
handles, process version, dependency version, process timestamp, process description, and 
dependency description. 

24. The system as recited in claim 23 wherein the first relational database comprises a system 
information table for maintaining the system infonnation for at least one computing device. 

25. The system as recited in claim 24 wherein the first relational database comprises a process 
table related to the system infonnation table, wherein the process table maintains information 

related to executable processes on a computing device. 
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26. The system as recited in claim 25 wherein the first relational database comprises a module 
table related to the system information table, wherein the module table maintains information 
related to modules on a computing device that are used by a process. 

27. The method as recited in claim 17 wherein the infomiation indicative of the characteristics 
of a computing device comprises information indicative of computing device database definition 
parameters. 

28. The system as recited in claim 27 wherein the computing device database definition 
parameters comprise at least one of: database names, roles, users, aliases, defaults, rules, 
functions, user defined datatypes, user messages, tables, views, indexes, extended procedures, 
stored procedures, and triggers. 

29- The system as recited in claim 28 wherein the first relational database coriiprises a database 
name table for maintaining the names of computing device database names. 

30. The system as recited in claim 29 wherein the first relational database comprises a table 
table related to the database name table, v^herein the table table maintains computing device 
database table names. " , 

31. The system as recited in claim 30 wherein the first relational database comprises a colunrn 
table related to the table table, wherein the colunrn table maintains computing device database 
column names. 

32. The system as recited in claim 29 comprising a second table related to the database name 
table wherein the second table is a table comprising at least one of: trigger, procedure, database 
role, function, and procedure. 
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